NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Rectifying Privacy and Efficacy Measurements in Machine Unlearning: A New Inference Attack Perspective

Naderloui, Nima; Yan, Shenao; Wang, Binghui; Fu, Jie; Wang, Wendy Hui; Liu, Weiran; Hong, Yuan (August 2025, USENIX Security)

Free, publicly-accessible full text available August 13, 2026
Local Differentially Private Heavy Hitter Detection in Data Streams with Bounded Memory

https://doi.org/10.1145/3639285

Li, Xiaochen; Liu, Weiran; Lou, Jian; Hong, Yuan; Zhang, Lei; Qin, Zhan; Ren, Kui (March 2024, Proceedings of the ACM on Management of Data)

Top-k frequent items detection is a fundamental task in data stream mining. Many promising solutions are proposed to improve memory efficiency while still maintaining high accuracy for detecting the Top-k items. Despite the memory efficiency concern, the users could suffer from privacy loss if participating in the task without proper protection, since their contributed local data streams may continually leak sensitive individual information. However, most existing works solely focus on addressing either the memory-efficiency problem or the privacy concerns but seldom jointly, which cannot achieve a satisfactory tradeoff between memory efficiency, privacy protection, and detection accuracy. In this paper, we present a novel framework HG-LDP to achieve accurate Top-k item detection at bounded memory expense, while providing rigorous local differential privacy (LDP) protection. Specifically, we identify two key challenges naturally arising in the task, which reveal that directly applying existing LDP techniques will lead to an inferior accuracy-privacy-memory efficiency tradeoff. Therefore, we instantiate three advanced schemes under the framework by designing novel LDP randomization methods, which address the hurdles caused by the large size of the item domain and by the limited space of the memory. We conduct comprehensive experiments on both synthetic and real-world datasets to show that the proposed advanced schemes achieve a superior accuracy-privacy-memory efficiency tradeoff, saving 2300× memory over baseline methods when the item domain size is 41,270. Our code is anonymously open-sourced via the link.
more » « less
Full Text Available
Secure and Efficient Video Inferences with Compressed 3-Dimensional Deep Neural Networks

https://doi.org/10.1145/3714393.3726505

Liu, Bingyu; Arastehfard, Ali; Wang, Rujia; Liu, Weiran; Ba, Zhongjie; Zhou, Shanglin; Hong, Yuan (June 2024, ACM)

Full Text Available
ShapleyFL: Robust Federated Learning Based on Shapley Value

https://doi.org/10.1145/3580305.3599500

Sun, Qiheng; Li, Xiang; Zhang, Jiayao; Xiong, Li; Liu, Weiran; Liu, Jinfei; Qin, Zhan; Ren, Kui (August 2023, KDD '23: Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining)

Full Text Available
OpBoost: a vertical federated tree boosting framework based on order-preserving desensitization

https://doi.org/10.14778/3565816.3565823

Li, Xiaochen; Hu, Yuke; Liu, Weiran; Feng, Hanwen; Peng, Li; Hong, Yuan; Ren, Kui; Qin, Zhan (October 2022, Proceedings of the VLDB Endowment)

Vertical Federated Learning (FL) is a new paradigm that enables users with non-overlapping attributes of the same data samples to jointly train a model without directly sharing the raw data. Nevertheless, recent works show that it's still not sufficient to prevent privacy leakage from the training process or the trained model. This paper focuses on studying the privacy-preserving tree boosting algorithms under the vertical FL. The existing solutions based on cryptography involve heavy computation and communication overhead and are vulnerable to inference attacks. Although the solution based on Local Differential Privacy (LDP) addresses the above problems, it leads to the low accuracy of the trained model. This paper explores to improve the accuracy of the widely deployed tree boosting algorithms satisfying differential privacy under vertical FL. Specifically, we introduce a framework called OpBoost. Three order-preserving desensitization algorithms satisfying a variant of LDP called distance-based LDP (dLDP) are designed to desensitize the training data. In particular, we optimize the dLDP definition and study efficient sampling distributions to further improve the accuracy and efficiency of the proposed algorithms. The proposed algorithms provide a trade-off between the privacy of pairs with large distance and the utility of desensitized values. Comprehensive evaluations show that OpBoost has a better performance on prediction accuracy of trained models compared with existing LDP approaches on reasonable settings. Our code is open source.
more » « less
Full Text Available
Greenhouse-gas induced warming amplification over the Arabian Peninsula with implications for Ethiopian rainfall

https://doi.org/10.1007/s00382-021-05858-x

Cook, Kerry H.; Vizy, Edward K.; Liu, Yang; Liu, Weiran (January 2021, Climate Dynamics)
null (Ed.)
Full Text Available
Seasonal asymmetry of equatorial East African rainfall projections: understanding differences between the response of the long rains and the short rains to increased greenhouse gases

https://doi.org/10.1007/s00382-020-05350-y

Cook, Kerry H.; Fitzpatrick, Rory G.; Liu, Weiran; Vizy, Edward K. (October 2020, Climate Dynamics)
null (Ed.)
Full Text Available
Influence of Indian Ocean SST regionality on the East African short rains

https://doi.org/10.1007/s00382-020-05265-8

Liu, Weiran; Cook, Kerry H.; Vizy, Edward K. (June 2020, Climate Dynamics)

Full Text Available

Search for: All records